AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
4-bit weight quantization

# 4-bit weight quantization

Gemma 3 12b It Quantized W4A16
Gemma 3 is an instruction-tuned large language model developed by Google. This repository provides its 12B parameter W4A16 quantized version, significantly reducing memory requirements while maintaining good performance.
Large Language Model Transformers
G
abhishekchohan
1,754
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase